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Abstract 

In this two-parts paper we propose a decentralized strategy, based on a game-theoretic for- 
mulation, to find out the optimal precoding/multiplexing matrices for a multipoint-to- multipoint 
communication system composed of a set of wideband links sharing the same physical resources, i.e., 
time and bandwidth. We assume, as optimality criterion, the achievement of a Nash equilibrium 
and consider two alternative optimization problems: 1) the competitive maximization of mutual 
information on each link, given constraints on the transmit power and on the spectral mask imposed 
by the radio spectrum regulatory bodies; and 2) the competitive maximization of the transmission 
rate, using finite order constellations, under the same constraints as above, plus a constraint on the 
average error probability. In Part I of the paper, we start by showing that the solution set of both 
noncooperative games is always nonempty and contains only pure strategies. Then, we prove that the 
optimal precoding/multiplexing scheme for both games leads to a channel diagonalizing structure, so 
that both matrix-valued problems can be recast in a simpler unified vector power control game, with 
no performance penalty. Thus, we study this simpler game and derive sufficient conditions ensuring 
the uniqueness of the Nash equilibrium. Interestingly, although derived under stronger constraints, 
incorporating for example spectral mask constraints, our uniqueness conditions have broader valid- 
ity than previously known conditions. Finally, we assess the goodness of the proposed decentralized 
strategy by comparing its performance with the performance of a Pareto-optimal centralized scheme. 
To reach the Nash equilibria of the game, in Part II, we propose alternative distributed algorithms, 
along with their convergence conditions. 

1 Introduction and Motivation 

In this two-parts paper, we address the problem of finding the optimal precoding/multiplexing strategy 
for a multiuser system composed of a set of Q noncooperative wideband links, sharing the same physical 
resources, e.g., time and bandwidth. No multiplexing strategy is imposed a priori so that, in principle, 
each user interferes with each other. Moreover, to avoid excessive signaling and the need of coordination 
among users, we assume that encoding/decoding on each link is performed independently of the other 
links. Furthermore, no interference cancellation techniques are used and thus multiuser interference 
is treated as additive, albeit colored, noise. We consider block transmissions, as a general framework 
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encompassing most current schemes hke, e.g., CDMA or OFDM systems (it is also a capacity- lossless 
strategy for sufficiently large block length [HE])- Thus, each source transmits a coded vector 



— FqrSg, (1) 

where Sg is the x 1 information symbol vector and Fg is the N x N precoding matrix. Denoting with 
Hrq the channel matrix between source r and destination q, the sampled baseband block received by 
the q-th destination is (dropping the block indexjl] 

Q 

Yq = HqgXg + ^ Hrq^r + Wg, (2) 

rj^q=l 

where Wg is a zero-mean circularly symmetric complex Gaussian white noise vector with covariance 
matrix The second term on the right-hand side of ([2]) represents the Multi-User Interference 

(MUI) received by the q-th destination and caused by the other active links. Treating MUI as additive 
noise, the estimated symbol vector at the q-th receiver is 

Sg = D [Gf y,] , (3) 

where is the N x N receive matrix (linear equalizer) and D[-] denotes the decision operator that 
decides which symbol vector has been transmitted. 

The above system model is sufficiently general to incorporate many cases of practical interest, such 
as: i) digital subscriber lines, where the matrices (F^)^]^ incorporate DFT precoding and power alloca- 
tion, whereas the MUI is mainly caused by near-end cross talk ii) cellular radio, where the matrices 
(Fg)^^ contain the user codes within a given cell, whereas the MUI is essentially intercell interfer- 
ence [1]; iii) ad hoc wireless networks, where there is no central unit assigning the coding/multiplexing 
strategy to the users [5]. The I/O model in ([2]) is particularly appropriate for studying cognitive radio 
systems [6] , where each user is allowed to re- use portions of the already assigned spectrum in an adap- 
tive way, depending on the interference generated by other users. Many recent works have shown that 
considerable performance gain can be achieved by exploiting some kind of information at the trans- 
mitter side, either in single-user [2], [7]- [9] or in multiple access or broadcast scenarios (see, e.g. [10]). 
Here, we extend this idea to the system described above assuming that each destination has perfect 
knowledge of the channel from its source (but not of the channels from the interfering sources) and of 
the interference covariance matrix. 

Within this setup, the system design consists on finding the optimal matrix set (Fg, Gq)'^^^ accord- 
ing to some performance measure. In this paper we focus on the following two optimization problems: 
P.l) the maximization of mutual information on each link, given constraints on the transmit power 
and on the spectral radiation mask; and P.2) the maximization of the transmission rate on each link, 
using finite order constellations, under the same constraints as above plus a constraint on the average 
(uncoded) error probability. The spectral mask constraints are useful to impose radiation limits over 
licensed bands, where it is possible to transmit but only with a spectral density below a specified value. 
'^For brevity of notation, we denote as source (destination) q the source (destination) of link q. 

^We consider only white noise for simplicity, but the extension to colored noise is straightforward along well-known 
guidelines. 



Problem P.2 is motivated by the practical need of using discrete constellations, as opposed to Gaussian 
distributed symbols. 

Both problems P.l and P.2 are multi-objective optimization problems [11], as the (information/ 
transmission) rate achieved in each link constitutes a different single objective. Thus, in principle, the 
optimization of the transceivers requires a centralized computation (see, e.g., [121 [13] ^or a special case 
of problem P.l, with diagonal transmissions and no spectral mask constraints). This would entail a 
high complexity, a heavy signaling burden, and the need for coordination among the users. Conversely, 
our interest is focused on finding distributed algorithms to compute {Fq, Gq)^^^ with no centralized 
control. To achieve this goal, we formulate the system design within a game theory framework. More 
specifically, we cast both problems P.l and P.2 as strategic noncooperative (matrix- valued) games, 
where every link is a player that competes against the others by choosing its transceiver pair (Fq, Gq) 
to maximize its own objective (payoff) function. This converts the original multi-objective optimization 
problem into a set of mutually coupled competitive single-objective optimization problems (the mutual 
coupling is precisely what makes the problem hard to solve). Within this perspective, we thus adopt, 
as optimality criterion, the achievement of a Nash equilibrium, i.e., the users' strategy profile where 
every player is unilaterally optimum, in the sense that no player is willing to change its own strategy as 
this would cause a performance loss [Il]-[l6]. This criterion is certainly useful to devise decentralized 
coding strategies. However, the game theoretical formulation poses some fundamental questions: 1) 
Under which conditions does a NE exist and is unique? 2) What is the performance penalty resulting 
from the use of a decentralized strategy as opposed to the Pareto-optimal centralized approach? 3) 
How can the Nash equilibria be reached in a totally distributed way? 4) What can be said about 
the convergence conditions of distributed algorithms? In Part I of this two-part paper, we provide an 
answer to questions 1) and 2). The answer to questions 3) and 4) is given in Part II. 

Because of the inherently competitive nature of a multi-user system, it is not surprising that game 
theory has been already adopted to solve many problems in communications. Current works in the field 
can be divided in two large classes, according to the kind of games dealt with: scalar and vector power 
control games. In scalar games, each user has only one degree of freedom to optimize, typically the 
transmit power or rate, and the solution has been provided in a very elegant framework, exploiting the 
theory of the so called standard functions [l7]-[22]. The vector games are clearly more complicated, as 
each user has several degrees of freedom to optimize, like user codes or power allocation across frequency 
bins, and the approach based on the "standard" formulation of [I7j-[T9] is no longer valid. A vector 
power control game was proposed in [23] to maximize the information rates (under constraints on the 
transmit power) of two users in a DSL system, modeled as a frequency-selective Gaussian interference 
channel. The problem was extended to an arbitrary number of users in [24j-|28]. Vector power control 
problem in flat-fading Gaussian interference channels was addressed in [29] . 

The original contributions of this paper with respect to the current literature on vector games [23] - 
[29] are listed next. We consider two alternative matrix-valued games, whereas in |23]-[27]. |29] the 
authors studied a vector power control game which can be obtained from P.l as a special case, when the 
diagonal transmission is imposed a priori and there are no spectral mask constraints. Problem P.2, at 
the best of the authors' knowledge, is totally new. The matrix nature of the players' strategies and the 
presence of spectral mask constraints make the analysis of both games P.l and P.2 complicated and none 
of the results in [23]- [29] can be successfully applied. Our first contribution is to show that the solution 



set of both games is always nonempty and contains only pure (i.e., deterministic) strategies. More 
important, we prove that the diagonal transmission from each user through the channel eigenmodes 
(i.e., the frequency bins) is optimal, irrespective of the channel state, power budget, spectral mask 
constraints, and interference levels. This result yields a strong simplification of the original optimization, 
as it converts both complicated matrix-valued problems P.l and P.2 into a simpler unified vector power 
control game, with no performance penalty. Interestingly, such a simpler vector game contains, as 
a special case, the game studied in |23|-|27|. when the users are assumed to transmit with the same 
(transmit) power and no spectral mask constraints are imposed. The second important contribution of 
the paper is to provide sufficient conditions for the uniqueness of the NE of our vector power control 
game that have broader validity than those given in [23]- [27], [29] (without mask constraints) and, more 
recently, in [28] (including mask constraints). Our uniqueness condition, besides being valid in a broader 
context than those given in [23] -[29], exhibits also an interesting behavior not deducible from the cited 
papers: It is satisfied as soon as the interlink distance exceeds a critical value, almost irrespective of 
the channel frequency response. Finally, to assess the performance of the proposed game-theoretic 
approach, we compare the Nash equilibria of the game with the Pareto-optimal centralized solutions 
to the corresponding multi-objective optimization. We also show how to modify the original game in 
order to make the Nash equilibria of the modified game to coincide with the Pareto-optimal solutions. 
Not surprisingly, the Nash equilibria of the modified game can be reached at the price of a significant 
increase of signaling and coordination among the users. 

The paper is organized as follows. In Section[2l the optimization problems P.l and P.2 are formulated 
as strategic noncooperative games. Section [3] proves the optimality of the diagonal transmission and 
in Section |4] the conditions for the existence and uniqueness of the NE are derived. Section [5] gives a 
physical interpretation of the NE, with particular emphasis on the way each user allocates power across 
the available subchannels. Section [6] assesses the goodness of the NE by comparing the performance 
of the decentralized game-theoretic approach with the centralized Pareto-optimal solution. Numerical 
results are given in Section [71 Finally, in Section [HI the conclusions are drawn. Part of this work already 
appeared in [261 1271 [30l [3l] . 

2 System Model and Problem Formulation 

In this section we clarify the assumptions and constraints underlying the model ([2]) and we formulate 
the optimization problem addressed in this paper explicitly. 

2.1 System model 

Given the I/O system in ([5]), we make the following assumptions: 

A.l Neither user coordination nor interference cancellation is allowed; consequently encoding/decoding 
on each link is performed independently of the other links. Hence, the overall system in (|2]) is modeled 
as a vector Gaussian interference channel [M], where MUI is treated as additive colored noise; 
A. 2 Each channel is modeled as a FIR filter of maximum order Lh and it is assumed to change suffi- 
ciently slowly to be considered fixed during the whole transmission, so that the information theoretical 
results are meaningful; 

A. 3 In the case of frequency selective channels, with maximum channel order Lh, a cyclic prefix of 
length L > Lfi is incorporated on each transmitted block Xg in ([1|); 



A. 4 A (quasi-) block synchronization among the users is assumed, so that aU streams are parsed into 
blocks of equal length, having the same temporization, within an uncertainty at most equal to the 
cyclic prefix length; 

A. 5 The channel from each source to its own destination is known to the intended receiver, but not to 
the other terminals; an error-free estimate of MUX covariance matrix is supposed to be available at each 
receiver. Based on this information, each destination computes the optimal precoding matrix for its 
own link and transmits it back to its transmitter through a low (error-free) bit rate feedback channel|f| 

Assumption A.l is motivated by the need of finding solutions, possibly sub-optimal, but that 
can be obtained through simple distributed algorithms, that require no extra signaling among the 
users. This assumption is well motivated in many practical scenarios, where additional limitations 
such as decoder complexity, delay constraints, etc., may preclude the use of interference cancellation 
techniques. Assumption A. 3 entails a rate loss by a factor N/{N + L), but it facilitates symbol 
recovery. For practical systems, is sufhciently large with respect to L, so that the loss due to CP 
insertion is negligible. Observe that, thanks to the CP insertion, each matrix H^g in ([2]) resulting 
after having discarded the guard interval at the receiver, is a Toeplitz circulant matrix. Thus, H^g 
is diagonalized as Hrq = WD^gW^, with W G q^^^ denoting the normalized IFFT matrix, i.e., 
[W] = eJ2'^(*-i)(j-i)/^/^/iV for i, J = 1, . . . , A and D^g is a A x A diagonal matrix, where ['Drq]kk - 
Hrq{k)/ ^ (Uq is the frequency-response of the channel between source r and destination g, including the 
path-loss drq with exponent 7 and normalized fading Hrq{k), with drq denoting the distance between 
transmitter r and receiver q. 

The physical constraints required by the applications are: 
Co.l Maximum transmit power for each transmitter, i.e., 

^{l|x,|l2} = ^Tr(FgFf)<Pg, (4) 

where Pq is power in units of energy per transmitted symbol, and the symbols are assumed to be, without 
loss of generality (w.l.o.g.), zero-mean unit energy uncorrelated symbols, i.e., E |sg(n)s^(n)| = I. Note 
that different symbols may be drawn from different constellations. 
Co. 2 Spectral mask constraint, i.e., 

E{|[W^FgSg],|'} = [W^FgFfw]^^<p--(A:), VfcG {!,..., A}, (5) 

where p^^^{k) represents the maximum power user q is allowed to allocate on the k-th frequency bin. 
Constraints in ([5]) are imposed by radio spectrum regulations and attempt to limit the amounts of 
interference generated by each transmitter over some specified frequency bands. 
Co. 3 Maximum tolerable (uncoded) symbol error rate (SER) on each link, i.e.l^ 

Pe,qik) ^ Pmh{Sq{k) / Sq{k)} < Pig, \/k G {1, . . . , A}, (6) 



''in practice, both estimation and feedback are inevitably affected by errors. This scenario can be studied by extending 
our formulation to games with partial information [141 115] . but this goes beyond the scope of the present paper. 

■'Observe that if (1/iV) ^^p'^'^'^ik) < Pq, we obtain the trivial solution [W^F,Ff W]fcfc = Vfc. 

^ Given the symbol error probability P^, the Bit Error Rate (BER) Pi, can be approximately obtained from P^ (using 
a Gray encoding to map the bits into the constellation points) as Pb — Pc/ loggdC]), where logjdC]) is the number of bits 
per symbol, and \C\ is the constellation size. 



where Sg{k) is the A;-th entry of Sg given in ([3]). Another alternative approach to guarantee the required 
quality of service (QoS) of the system is to impose an upper bound constraint on the global average 
BER of each link, defined as (1/-/V) Y^k=i Pe,q{k)- Interestingly, in [35] it was proved that equal BER 
constraints on each subchannel as given in ([6]), provide essentially the same performance of those 
obtained imposing a global average BER constraint, as the average BER is strongly dominated by the 
minimum of the BERs on the individual subchannels. Thus, for the rest of the paper we consider BER 
constraints as in ([H]). 

2.2 Problem Formulation: Optimal Transceivers Design based on Game Theory 

In this section we formulate the design of the transceiver pairs (Fq,Gq)^^i of system ^ within the 



framework of game theory, using as optimality criterion the concept of NE |14j-|16j. We consider two 
classes of payoff functions, as detailed next. 

2.2.1 Competitive maximization of mutual information 

In this section we focus on the fundamental (theoretic) limits of system ([2]), under A.1-A.5, and 
consider the competitive maximization of information rate of each link, given constraints Co.l and 
Co. 2. Using A.l, the achievable information rate for user q is computed as the maximum mutual 
information between the transmitted block Xg and the received block y^, assuming the other received 
signals as additive (colored) noise. It is straightforward to see that a (pure or mixed strategy) NE 
is obtained if each user transmits using Gaussian signaling, with a proper precoder Fg. In fact, for 
each user, given that all other users use Gaussian codebooks, the codebook that maximizes mutual 
information is also Gaussian [H^. Hence, given A. 5, the mutual information for the q-th user is [3l] 

I,(F„F_,) = llog(|l + FfH5RliH,,F,|) (7) 

Q 

where R_g = cr^I + ^ HrgF^F^H^ is the interference plus noise covariance matrix, observed by 

r^q=l 

user q, and F_q = (Fj.)^_^^^-^ is the set of all the precoding matrices, except the g-th one. Observe that, 
for each link, we can always assume that the receiver is composed of an MMSE stage followed by some 



other stage, since the MMSE is capacity-lossless. Thus, w.l.o.g., we assume in the following thatl 

Gg = R:iH,gF,(I + FfHf^R:iHF,)-i, V<7 G {1, . . . , Q}. (8) 

Hence, the strategy of each player reduces to finding the optimal precoding Fg that maximizes 
lq(Fg,F-q) in ([7]), under constraints Co.l and Co. 2. Stated in mathematical terms, we have the 
following strategic noncooperative game 



maximize Ig(F„,F_g) 



subject to Fg G 



Vg G (9) 



q ^ q, 



where $7 = {1, . . . , Q} is the set of players (i.e., the links), lq{Fg, F_q) is the payoff function of player 
q, given in d?]), and ^g is the set of admissible strategies (the precoding matrices) of player q, defined 



®It is straightforward to verify that the MMSE receiver in ([8]) is capacity-lossless by checking that, for each g, the 
mutual information (for a given set of (Fq)^^^) after the equalizer Gq, log(|I + F^H^Gq(G^R_qG,)~^G^HgqFq|) is 
equal to 



as 

^,^|f,G C^x^:lTr(F,Ff)<P„ [W^F.Ff W] < p--(fc), Vfc = 1, . . . , ivj . (10) 
The solutions to ([9]) are the well-known Nash equilibria, which are formally defined as follows. 
Definition 1 A (pure) strategy profile F* = (F*)^^^^ e x . . . x is a NE of game if 

I,(F^,F%)>Ig(F„F%), VF,E^„VgGO. (11) 

The definition of NE as given in (jlip can be generalized to contain mixed strategies |14] . i.e., the 
possibility of choosing a randomization over a set of pure strategies (the randomizations of different 
players are independent). Hence, the mixed extension of the strategic game is given by = 
|o, {^g}ggf7, {Igj^g^l , where denotes the set of the probability distributions over the set of 
pure strategies. In game ^^i, the strategy profile, for each player q, is the probability density function 
fFq(Pq) defined on ^g and the payoff function Tg = Ejj,_^Ejp_ {l^} is the expectation of Ig defined 
in ([7]) taken over the mixed strategies of all the players. A mixed strategy NE of a strategic game is 
defined clS ct NE of its mixed extension [14j . 

Observe that for the payoff functions defined in d?]), we can indeed limit ourselves to adopt pure 
strategies w.l.o.g., as we did in ([9]). Too see why, consider the mixed extension Wi of in ([9]). For any 
player q, we have 

E/F,E/F_, {I, (F„F_,)} < Ej^_^ {ig (Ej,^ {FJ ,F_,)} , VF_, : /f_,(F_,) G (12) 

where ^-g = ^\ x . . . x ^ q-\ x x . . . x ^ q. The inequality in (fT2]) follows from the concavity of 

the function Iq(Fg,F_g) in F^F^ [33] and from Jensen's inequality [3l]. Since the equality is reached 
if and only if Fg reduces to a pure strategy (because of the strict concavity of Iq(Fg,F_g) in F^F^ 
whatever the strategies of the other players are, every NE of the game is achieved using pure strategies 

2.2.2 Competitive maximization of transmission rates 

The optimality criterion chosen in the previous section requires the use of ideal Gaussian codebooks 
with a proper covariance matrix. In practice, Gaussian codes are substituted with simple (suboptimal) 
finite order signal constellations, such as Quadrature Amplitude Modulation (QAM) or Pulse Amplitude 
Modulation (PAM), and practical (yet suboptimal) coding schemes. Hence, in this section, we focus on 
the more practical case where the information bits are mapped onto constellations of finite size (with 
possibly different cardinality), and consider the optimization of the transceivers (Fq,Gg)ygnj in order 
to maximize the transmission rate on each link, under constraints Co.l Co. 3. 

Given the signal model in ([2]), where now each vector Sg = (sg(A;))^]^ is drawn from a set of finite- 
constellations {Ck,g)^^i , i.e., Sq{k) G Cfc^g, the transmission rate of each link is simply the number of 
transmitted bits per symbol, i.e., n 

rg = ^log^{\Ck,g\), (13) 

k=l 

'^This result was obtained independently in [29j-[31]. 



where \Ck,q\ denotes the size of consteUation Ck^q- The (uncoded) average error probabihty of the q'-th 
link on the k-th substream, as defined in ([6]), under the Gaussian assumption, can be analytically 
expressed, for any given set (Fg,Gg)ggQ and {Ck,q)^^i, as 



p 

e,9 



(k) = ak,qQ (^/3,,,SINR,,,) , (14) 



where a^^g and Pi. ,^ are constants that depend on the signal constellation, Q (•) is the Q-function [36], 
and SINRfc^q is defined as 

2 

SINK,, 4 J^^^^!!!^, (15) 

with = HqqFqF^H^g — Hgqffc^qf^gH^ + R_q, whcrc fk,q denotes the k-th column of Fg, and 
R_, = all+ § H,,F,F^Hf, (see, e.g., fHW- 

r^q=l 

According to the constraints Co. 3 in because of p4|) . the optimal linear receiver for each user 
q can be computed as the matrix Gq maximizing simultaneously all the (SINRfc^^)^^ in (jlSp . while 
keeping the set of precoding matrices (Fq)^^^^ and the constellations {Ck,q)^^i ^gf^fixed. This leads to 
the well-known Wiener filter for G^, as given in ([8]) 0181 [9], and the following expression for the SINRs 
in (fT5l) : 

SINRfco = SINRfc„(F„,F_„) = ^ , 1, A; G {1, . . . , iV}. (16) 

''^ ' [(I + FfH^^,RI^H,,F,)-i]^^ t ' ' / y ) 

Under the previous setup, each player has to choose the precoder F^ and the constellations (C^^g)^^ 
that maximize the transmission rate in (jl3p . under constraints Co.l Co. 3. Since, for any given rate, 
the optimal combination of the constellations {Ck,q)^^i would require an exhaustive search over all the 



, the classical method to 
As a result, the number 



combinations that provide the desired rate, in the following we adopt, as in [9 
choose quasi-optimal combinations, based on the gap approximation |37l I38j 1^ 
of bits that can be transmitted over the substreams from the g-th source, for a given family of 
constellations and a given error probability P*q, is approximatively given by 

r,(F,.F-,)4E.o..(l.5EJME^) 

where SINRfc^g(Fg, F_g) is defined in (fT6l) . and > 1 is the gap which depends only on the constella- 
tions and on P*q. For M-QAM constellations, e.g., if the error probability in (jl4p is approximated by 
Pe,q{k) ~ 4Q (v/(3/(M-l))SINRfc,g) , the resulting gap is F, = (Q-H^'e,g/4))V3 [Qj. 
In summary, the structure of the game is 

maximize r„(F„,F_„) 

VgGO, (18) 

subject to Fq e ^q, 

where and rg(Fg, F_q) are defined in ([TO]) and p!7|) . respectively. As in Q, in the following we focus 
on pure strategies only. 



*In our optimization we will use, as optimal solution, the continuous bit distribution obtained by the gap approximation, 
without considering the effect on the optimality of the granularity and the bit cap. The performance loss induced by these 
sources of distortion can be quantified using the approach given in |^. 



3 Optimality of the Channel-Diagonalizing Structure 



We derive now the optimal set of precoding matrices (Fq)q(z^ for both games and J^2, and provide a 
unified reformulation of the original complicated games in a simpler equivalent form. The main result 
is summarized in the following theorem. 

Theorem 1 An optimal solution to the matrix-valued games and ^2 is 



Fq = W^diag(p,), yq G Q, (19) 

where W is the IFFT matrix, and p = (pq)<jerj, with p^ = {pq{k))^^^, is the solution to the vector-valued 
game defined as 

maximize Ra(pn,P-a) 
(^) : P« , Vg G f^, (20) 

subject to Pq G 

where Rq{pq,p-q) and ^q are the payoff function and the set of admissible strategies of user q, respec- 
tively, defined as 

Rq{Pq,P-q) = Yl^Og i^l + — S\mqik)j , (21) 



and 



Lqe R^:^ J]p,(A;)<l, < Pq{k) < p^'^^ik), VA; G {1, . . . , iV} 

I k=l 



(22) 



with p^^^'ik) = p'^^''{k)/Pq, 



3,^^ .j^. ^ Pq\Hqq{k)\'pq{k)/cPqq ^ \Hqq{kfpq{k) 

' ^l + Y.r^qPr\Hrq{k)\^Pr{k)/dJq 1 + E r^, I ^rg(fc) | ' p. (/c) ' 



where H^qik) — Hrq{k)y Pj-/ [cr"^ drq) , and Tq = 1 if is considered. 

Proof. See Appendix E ■ 

Remark 1 — Optimality of the diagonal transmission. According to Theorem [H a NE of both 
games §fi and ^2 is reached using, for each user, a diagonal transmission strategy through the channel 
eigenmodes (i.e., the frequency bins), irrespective of the channel realizations, power budget, spectral 
mask constraints and MUX. This result simplifies the original matrix-valued optimization problems ([9]) 
and (jlSp . as the number of unknowns for each user reduces from A'"^ (the original matrix Fg) to N (the 
power allocation vector = (Pg(fc))^i, with no performance loss. 

Observe that the optimality of the diagonalizing structure was well known in the single-user case, 
when the optimization criterion is the maximization of mutual information and the constraint is the 
average transmit power [Z]-[9], [35]. However, under the additional constraint on the spectral emission 
masks, the optimality of the diagonal transmission has never been proved, neither in a single-user nor 
in a multi-user competitive scenario. But, most interestingly, Theorem [1] proves the optimality of the 
diagonal transmission also for game 5^2; where each player maximizes the transmission rate, using finite 
order constellations, and under constraints on the spectral emission mask, transmit power, and average 
error probability. In such a case, the optimality of the channel-diagonalizing scheme was not at all 



clear. Previous works on this subject adopted the typical approach used in single-user MIMO systems 
[23]- [27]: They first imposed the diagonal transmission and then employed the gap approximation 
solution over the set of parallel subchannels. However, such a combination of channel diagonalization 
and gap approximation was not proved to be optimal. Conversely, Theorem [1] proves the optimality of 
this approach and it subsumes, as particular cases, the results of [23]-[27], corresponding to the simple 
case where there are no mask constraints. 

It is also worth noticing that the optimality of the diagonalizing structure is a consequence of the 
property that all channel matrices, under assumptions A. 2 and A. 3, are diagonalized by the same 
matrix, i.e., the IFFT matrix W. There is another interesting scenario where this property holds true: 
The case where all the channels are time-varying flat fading and the constraints are on the transmit 
power and on the maximum power that can be emitted over some specified time intervals (this is 
the dual version of the spectral mask constraint). In such a case, all channel matrices are diagonal 
and then it is trivial to see that they have a common diagonalizing matrix, i.e., the identity matrix. 
Applying duality arguments to Theorem 1, the optimal transmission strategy for each user is a sort 
of TDMA over a frame of time slots, where each user optimizes the power allocation across the 
time slots (possibly sharing time slots with the other users). Clearly, as opposed to the case considered 
in Theorem 1, in the time-selective case, the transmitter needs to have a non-causal knowledge of the 
channel variation. In practice, this kind of knowledge would require some sort of channel prediction. 

According to Theorem (H instead of considering the matrix- valued games and ; we may focus 
on the simpler vector game S^, with no performance loss. It is straightforward to see that a NE of both 
matrix-valued games exists if the solution set of ^ is non empty. Moreover, the Nash equilibria of , 
if they exist, must satisfy the waterfilling solution for each user, i.e., the following system of nonlinear 
equations: 

p* = WF,(pt,...,p^_i,pJ+i,...,p*) =WF,(p%) , yqGn, (24) 
with the waterfilling operator WFg (•) defined as 



[WF,(p-,)],^ 



kG{l,...,N}, (25) 



where [x]^ denotes the Euclidean projection of x onto the interval [a, and the water-level //g is chosen 
to satisfy the power constraint (l/N) Ylk=iPqik) = 1- 

Given the nonlinear system of equations (|24p . the fundamental questions are: i) Does a solution 
exist? ii) If a solution exists, is it unique? iii) How can such a solution be reached in a distributed way? 

The answer to the first two questions is given in the forthcoming sections, whereas the study of 
distributed algorithms is addressed in Part II of this paper |32] . 



4 Existence and Uniqueness of NE 

Before providing the conditions for the uniqueness of the NE of game ^, we introduce the following 
intermediate definitions. Given game define H(A;) € M*3xQ a,s 

®The Euclidean projection [x]''^ is defined as follows: [a;]^ = a, 'd x < a, [x]''^ = x, ii a < x < b, and [x]''^ — b, ii x > b. 



iqr 



\H,,{k)\^d'^^P,' ^ ■ ■ - (26) 

0, otherwise, 

where Dq denotes the set {1, . . . , A^} (possibly) deprived of the carrier indices that user q would never 
use as the best response set to any strategy used by the other users, for the given set of transmit power 
and propagation channels: 

^ {A: G {1, . . . , iV} : 3 p_, G such that [WF, (p_,)]^ / O} , (27) 

with WFg (•) defined in ([25]) and = x • • • x x ^g+i x • • • x ^q. 

The study of game ^ is addressed in the following theorem. 

Theorem 2 Game ^ admits a nonempty solution set for any set of channels, spectral mask constraints 
and transmit power of the users. Furthermore, the NE is unique if 

/>(H(A:))<1, VA;G {!,..., Af}, (CI) 

where H(fc) is defined in [2^) and p(H{k)) denotes the spectral radiul^ o/H(/c). 

Proof. See Appendix |B1 ■ 
We provide now alternative sufficient conditions for Theorem [2l To this end, we first introduce the 
matrix H"^^^ gRQ^'Q, defined as 



r-rj-maxi A 

\qr = 



n max —= — , , ,^ —r- — -, ii t ^ q, 

« k<^v,nvr \Hqq{k)\'^ d»qPg' ^ ^' (28) 

0, otherwise. 



with the convention that the maximum in (j28p is zero if 'DqCi'Dr is empty. Then, we have the following 
corollary of Theorem [2j 



Corollary 1 A sufficient condition for ( fCij) is: 

y9(H™^^)<l, (C2) 



where H™**^ is defined in [28]) . 

To give additional insight into the physical interpretation of the conditions for the uniqueness of 
the NE, we introduce the following corollary. 

Corollary 2 A sufficient condition for iCl]) is given by one of the two following set of conditions: 

^ Wr[n{k)]g^<l, yq£n, and Wk £ {1, . . . , N}, (C3) 

^ r=l,r^q 



— TgWg[lI{k)lg<l, W€n, and VA; G {1, . . . , A^}, (C4) 

q=l,qj^r 

where w = [wi, . . . ,wq]'^ is any positive vector and H(A;) is defined in h2&\) . 



^"The spectral radius p (A) of the matrix A is defined as p(A) = max{|A| : A G a(A)}, with (t(A) denoting the 
spectrum of A (50j . 



Note that, as a by-product of the proof of Theorem [21 one can always choose T>q = {1, . . . , A^} in 
(|Cl|) -( iC4l) . i.e., without excluding any subcarrier. However, less stringent conditions are obtained by 
removing the unnecessary carriers, i.e., those carriers that, for a given power budget and interference 
levels, are never going to be used. 

Remark 2 — Physical interpretation of uniqueness conditions. As expected, the uniqueness of 
NE is ensured if the links are sufficiently far apart from each other. In fact, from ()C3P - ()C4p for example, 
one infers that there exists a minimum distance beyond which the uniqueness of NE is guaranteed, 
corresponding to the maximum level of interference that may be tolerated by the users. Specifically, 
condition ()C3P imposes a constraint on the maximum amount of interference that each receiver can 
tolerate; whereas ()C4p introduces an upper bound on the maximum level of interference that each 
transmitter is allowed to generate. This result agrees with the fact that, as the MUX becomes negligible, 
the rates Rq in (|21|) become decoupled and then the rate-maximization problem in (|2U|) for each user 
admits a unique solution. But, the most interesting result coming from conditions ()Cip - ()C4p is that 



the uniqueness of the equilibrium is robust against the worst normalized channels \Hr-q{k)\'^ / \Hqq{k)\^] 
in fact, the subchannels corresponding to the highest ratios \Hrq{k)\'^ /\Hqq{k)\^ (and, in particular, the 
subchannels where \Hqq{k)\'^ is vanishing) do not necessarily affect the uniqueness condition, as their 
carrier indices may not belong to the set Vq. 

Remark 3 — Uniqueness condition and distributed algorithms. Interestingly, condition ()C2p . 
in addition to guarantee the uniqueness of the NE, is also responsible for the convergence of both 
simultaneous and sequential iterative waterfilling algorithms, proposed in Part II of the paper 



Remark 4 — Comparison with previous conditions. Theorem [2] unifies and generalizes many 
existence and uniqueness results obtained in the literature [23]-[27], [29] for the special cases of game §f 
in ([2m) . Specifically, in [23]- [27] a game as in ([20]) is studied, where all the players are assumed to have 
the same power budget and no spectral mask constraints are considered [i.e., p^^^{k) = -|-cxd, V/c, g]. In 
[29| instead, the channel is assumed to be fiat over the whole bandwidth. Interestingly, the conditions 



obtained iii | 23 | -[27]. [29] are more restrictive than (|Cip - (|C4p . as shown in the following corollary of 
Theorem [2Lll 



Corollary 3 Sufficient conditions for (C^) are JM, 



\Hrq{k)\'^^ dqq Pr ^ 1 



Tq max <^ fv , o } T^TT < 7^ 7> V r, g / r € (C5) 

''ke{l,...,N}\\Hqq{k)\^ j d<^qPq Q-l' '^^ ^ ^ 



or fl4 

Tq max i~7rrPj\^lT < ^ > V r, g / r G fi. (C6) 



\Hrq{k)\^ \ d-qP, 1 
" fcefCiV} \\Hqq{k)\^ i d<^q Pq 2Q - 3 ' 

In the case of flat-fading channels (i.e., Hrq{k) = Hrq, "ir^q), condition WS\) becomes J2§\ 



'^We summarize the main results of [23)- [27) using our notation to facilitate the comparison. 
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(a) (b) 
Figure 1: Probability of (CI), (C5) and (C7) versus drq/dgq-, (5 = 5 [subplot (a)], Q = 15 [subplot (b)], 7 = 2.5, 

r, = 1, drq/dqq = dqr/drr, Vr, (J £ fi, TV = 64. 



ilecently, alternative sufficient conditions for the uniqueness of the NE of game 
^ Among all, an easy condition to be checked is the following: 

I + H(A:) is positive definite for all A; € {1, ... , A''}, 



were given m 



(C7) 



where H(fc) is defined as in ([^Sj) . with each Vq = {1, . . . , N}. 

All the conditions above depend on the channel realizations {-ffrij(fc)} and on the network topology 
through the distances {drq} ■ Hence, there is a nonzero probability that they are not satisfied for a given 
set of channel realizations, drawn from a given probability space. In order to compare the goodness 
of the above conditions, we tested them over a set of channel impulse responses generated as vectors 
composed of i.i.d. complex Gaussian random variables with zero mean and unit variance. We plot 
in Figured] the probability that conditions ()Cip . ()C5p and (|C7p are satisfied versus the ratio drq/dgq, 
i.e., the normalized interlink distance. For the sake of simplicity, we assumed drq = dqr, Pq = Pr and 
Tq = 1, Vg,r G 17. We considered Q = 5 [Figure [D^a)] and Q = 15 [Figure [Hb)] active links. We tested 



our condition considering in (jCip a set T>q obtained using the following worst case scenario. For each 
user g, we build the worst possible interferer as the virtual node (denoted by v) that has a power budget 
equal to the sum of the transmit powers of all the other users (i.e., X^r^g channel between its 

own transmitter and receiver q as the highest channel among all the interference channels with respect 
to receiver q, i.e., \H^q{k)\'^ = max^^g \Hrq{k)\^. We build a set that includes the set Vq defined in (p7|) 
using the following iterative procedure: For each subcarrier k, the virtual user distributes its own power 
(Sr^<j ) across the whole spectrum in order to facilitate user q to use the subcarrier k, as much as 
possible. If, even under these circumstances, user q is not going to use subcarrier k, because of its own 
power budget Pq and \Hqq{k)\'^ , then we are sure that index k can not possibly belong to Vq. 

We can see, from Figure [H that the probability that the NE is unique increases as the links become 
more and more separated of each other (i.e., the ratio drq/dqq increases). Furthermore, we can verify 
that, even having not considered the smallest possible set Vq, as defined in ([27|) . our condition (|Cip 
has a much higher probability of being satisfied than (jCSP and ()C7p . The main difference between 
our condition (jCip and (jCSp . (IC7p is that (jCip exhibits a neat threshold behavior since it transits 



We thank Prof. Facchinei, who kindly brought to our attention reference [2^, after this paper was completed. 



very rapidly from the non- uniqueness guarantee to the almost certain uniqueness, as the inter- user 
distance ratio drq/dqq increases by a small amount. This shows that the uniqueness condition (jCip 
depends, ultimately, on the interlink distance rather than on the channel realization. This represents 
the fundamental difference between our uniqueness condition and those given in the literature. As an 
example, for a system with Q = f> links and probability 0.99 of guaranteeing the uniqueness of the 
NE, condition ()Cip requires drq/dqq ~ 2 whereas conditions ()C5P and \C7\ require drq/dqq > 40 and 
drq/dqq ~ 24, respectively. Furthermore, this difference increases as the number Q of links increases. 

5 Physical Interpretation of NE 

In this section we provide a physical interpretation of the optimal power allocation corresponding 
to the NE in the limiting cases of low and high MUI0 To quantify what low and high interference 
mean, we introduce the SNR of link q (denoted by snr^) and the Interference-to-Noise Ratio due to the 
interference received by destination q and generated by source r, with r ^ q (denoted by inr^g), defined 
as snrg = Pq/ {a'^d'/^^) and inr^g = Pr / {Ugd'^g) . Using snrg and inr^g, the SINK sinrg(A;) in (p3|) can be 
rewritten as 



snrg 


\Hqqik)f Pq{k) 


'^ + J2r^q 


inr^g 


\Hrqik)\ 





(30) 



Low interference case. Consider the low interference case, i.e., the situation where the interference 
term in the denominator in (j30p can be neglected. A sufficient condition to satisfy this assumption is 
that the links are sufficiently far apart from each other, i.e., drq >> dqq Vr ^ q,r, q £ 0,. For sufficiently 
small inr^g and sufficiently large snrg, condition (jCip is satisfied and, hence, by Theorem 1, the NE is 
unique. Also, by inspection of the waterfilling solution in (|24p. it is clear that under those conditions, 
Pq{k) > for all (7 G and k £ {1, . . . , N}. This means that each source uses the whole bandwidth. 
Furthermore, it is well known that as the SNR increases, the waterfilling solution tends to a flat power 
allocation. In summary, we have the following result. 

Proposition 1 Given game there exist sets of values {inr*g},.^ggf2 and {snr*}q^Q, with inr*^ << 1 
and snr* >> 1, such that, for all inr^g < inr*^ and snrg > snr*, the NE of^ is unique (cf. Theorem{^ 
and all users share the whole available bandwidth. In addition, i/snrg >> snr*, then the optimal power 
allocation of each user tends to be flat over the whole bandwidth. 

From Proposition[Tl it turns out, as it could have been intuitively guessed, that when the interference 
is low, at the (unique) NE, every user transmits over the entire available spectrum (like a CDMA 
system), as in such a case nulling the interference would not be worth of the bandwidth reduction. 
As a numerical example, in Figure 2, we plot the optimal power spectral density (PSD) of a system 
composed of three links, for different values of the ratio drq/dqq. The results shown in Figure 2 have 
been obtained using the distributed algorithms described in Part II [32j. From Figure 2, we can check 
that, as the ratio drq/dqq increases, the optimal PSD tends to be flat, while satisfying the simultaneous 
waterfilling condition in (j24p . 

^■^For the sake of notation, in this section we consider only the case in which p]^^^{k) = +oo, \/q,\/k, but it is straight- 
forward to see that our derivations can be easily generalized to the case of spectral mask constraints. 



High interference case. When inr^-g >> 1 for all q and r ^ q, the interference is the dominant 
contribution in the equivalent noise (thermal noise plus MUI) in the denominator of (I30p . In this case, 
game ^ admits multiple Nash equilibria. An interesting class of these equilibria includes the FDMA 
solutions (called orthogonal Nash equilibria), occurring when the power spectra of different users are 
nonoverlapping. The characterization of these equilibria is given in the following. 

Proposition 2 Given game 1^, for each q £ Q,, let Tq = {k ^ {1, . . . , A^} : Pq{k) > 0, Pr{k) = 0, Vr 
^ q ^Vl} denote the set of subcarriers over which only user q transmits. For any given {snrglggQ, 
there exists {inr*^}r^ggf2, with each inr*^ >> 1, such that for all inr^g > inr*^, game ^ admits multiple 
orthogonal Nash equilibria. If, in addition, {snr^lggf^ and {inr^glr^ggn ore such that 

j^^inv^ kelq,r^qen, (31) 

' \Hqqikf Sr^rq 

and an orthogonal NE still exists, the subcarriers are allocated among the users according to 

^ ~ '"'^^ > g 2r and kg G Zq. (32) 



Hqq{kr)\ \Hqqik, 



Proof. See Appendix\^ ■ 

The above proposition has an intuitive interpretation: When the interference is very high, the users 
self-organize themselves in order to remove the interference totally, i.e., using nonoverlapping bands. 
In this case, game $f may have multiple orthogonal Nash equilibria. For example, in the simple case of 
Q = N, there are Q\ different orthogonal Nash equilibria, corresponding to all the permutations where 
each transmitter uses only one carrier. As the interference level decreases (i.e., the inr^g's), the NE 
becomes unique and in such a case, if an orthogonal equilibrium still exists, then the distribution of the 
subcarriers among the users must satisfy the rule given by ()32p . This strategy is similar, in principle 
to FDMA, but differently from standard FDMA, here each user is getting the "best" portion of the 
spectrum for itself. Interestingly, (|32p is the generalization of the condition satisfied by the subcarrier 
allocation in the multiple access frequency-selective channel, where the optimization problem is the 



sum-rate maximization under a transmit power constraint p9]. In Figure 2(b), we show a numerical 



example of the optimal power allocation at NE, for a system with two active links, in the case of high 



interference. From Figure 2(b) we observe that, as predicted by Proposition [21 different users tend to 
transmit over non-overlapping bands. 

In general, with intermediate interference levels, the optimal solution consists in allowing partial 
superposition of the PSDs of the users. An example of optimal PSD distribution for an intermediate 
level of interference is plotted in Fig ure |2(a)| ( see the curves obtained with drq/dqq = 5). 

6 How good is a Nash Equilibrium ? 

The optimality criterion used in this paper, based on the achievement of a NE, is useful to derive 
decentralized strategies. However, as the Nash equilibria need not to be Pareto-efficient [4^, this 
criterion, in general, does not provide any a priori guarantee about the goodness of the equilibrium. 
Even when the equilibrium is unique, it is important to know how far is the performance from the 
optimal solution provided by a centralized strategy [12l [26]. The scope of this section is thus to 
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(a) Optimal PSDs at NE in the low-interference case. 
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(b) Optimal PSDs at NE in the high-interference case. 



Figure 2: Optimal PSDs at NE in the low-interference and high-interference cases: a) Solid, dashed, and dashed-dot 
line curves refer on the PSD obtained for drq/dqq = 5,8,12, respectively, and snr, — 15 dB; b) Solid and dashed lines refer 
to PSD of each link and PSD of the MUI plus thermal noise, normalized by the channel transfer function square modulus 
of the link, respectively; di2/rf22 ~ d2i/dii = l,ri = r2 = 1, snri — snr2 = 5 dB. 

quantify the performance loss resulting from the use of a decentralized approach, with respect to the 
optimal centralized solution. To this end, we compare the rates of the users corresponding to the Nash 
equilibria of game in (j20p with the rates lying on the boundary of the largest region of the achievable 
rates obtained as the Pareto-optimal trade-off surface solving the multi-objective optimization based 
on pTD . 

The rate region is defined as the set TZ of all feasible data rate combinations r = [ri, . . . , tq]'^ £ M.'^ 
among the active links. A given vector of rates r G is said to be feasible if it is possible to transfer 
information in the network at these rates, reliably, for some power allocation p = {pg)qen satisfying 
the power constraints Pq £ \/q G Vt. The rate region can be numerically computed by considering 
all possible power allocations p such that Pq G l^q, G VL. Specifically, we have 

7^= (J {rGMj I <i?5(p),VgGJ7} = {rGM*^ I 3p:pgG <i?,(p), VgGJ^} 

Pq&!^q, gen 

(33) 

where Rq{p) and 3^q are given in (f2TI) and (f22l) . respectively. 

In general, the characterization of the rate region is a very difficult nonconvex optimization problem. 
Nevertheless, in the case of low interference, we have the following. 

Proposition 3 In the case of low interference (inr^g << 1), and high SNR {snvq » 1), the rate region 
IZ given in \33^) is a convex set. 

Proof. See Appendix [Dj ■ 
The best achievable trade-off among the rates is given by the Pareto optimal points of the set TZ 
given in ()33p . In formulas, all highest feasible rates can be found by solving the following multi-objective 
optimization problem (MOP) pT] : 

maximize {i?i(p), . . . , -Rq(p)} 
p 

subject to Pq G Vg G il. 

Practical algorithms to solve the MOP in (j34p can be obtained, using the approach proposed, e.g., 
in |131 142j . The Pareto optimal solutions to the MOP can also be obtained by solving the following 



(34) 



ordinary scalar optimization: 

Q 

maximize E ^g^g(p) 

P q=l (35) 

subject to Pq G Vg £ fi. 

where A is a set of positive weights. For any given A > 0, the (globally) optimal solution to (j35p is a 
point on the trade-off surface of MOP (j34p (cf. Appendix |Ej) . However, as the rate region (j2ip is in 
principle nonconvex, by varying A > 0, only a portion of the trade-off curve of (j34p can be explored. 
Specifically, all the Pareto optimal points lying on the nonconvex part of the rate region cannot be 
obtained as solutions to (|35p . We will refer to (j35p as the scalarized MOP. 

Comparing (I20p with (|34p . we infer that, in general, the Nash equilibria are not solutions to (134p . 
and thus they are not Pareto-efficient. An interesting question is whether one can modify the payoff 
function of every player so that some Nash equilibria of the modified game coincide with the Pareto 
optimal solutions. The answer is given by the following proposition. 

Proposition 4 Let ^ (A) he the game defined as 

#(A) = {l],{^J^^^,{5jgen}, (36) 

where the payoff functions are 

%{p) = Rq{p) + ^ V KRrip) (37) 

with Rq{p) defined in I121\) and A = [Ai, . . . , Aq]"^ is a set of positive weights. Then, for any given 
A > 0, the solution set of ?^(A) is not empty and contains the (globally) optimal solution to the 
scalarized MOP 135\) . which is the Pareto optimal solution to the MOP |g^[ ) corresponding to the point 
where the hyperplane with normal vector A is tangent to the boundary of the rate region i33\) . 
Moreover, if the conditions of Proposition are satisfied, then: 

1. ^(A) admits a unique NE, for any given A > 0, and 

2. An}0 Pareto optimal solution to the MOP {34^ can be obtained as the unique NE of^iX), with 
a proper choice of \ > 0. 

Proof. See Appendix |El ■ 
Comparing (j2ip with (j37p . an interesting interpretation arises: The Pareto-optimal solutions to 
the MOP (j34p can be achieved as a NE of the modified game where each player incorporates, in 
its utility function, a linear combination, through positive coefficient, of the utilities of the other 
playersO The NE of the modified game ?^(A) in (|36p can be obtained, for any given A, using, e.g., 
the gradient projection based iterative algorithm, proposed in [16j. However, this algorithm requires 
coordination among the players, since, at each iteration, each user has to know the channels and the 
strategies adopted by all the other users. Thus, Pareto-efficiency can be achieved only at the price 



^*We do not consider the rate profiles where the rate of some user is zero, w.l.o.g.. The corner points of the rate region 
can be achieved solving a lower dimensional problem. 

^® An alternative approach to move toward Pareto optimality is to allow that the game could be played more than once, 
i.e., to consider the so called repeated games [M], with a proper punishment strategy [41]. This study goes beyond the 
scope of this paper. 



of a significant increase of signalling and coordination among the users and this goes against our 
search for distributed, independent, coding/decoding among the users. Note that the structure of (f37|l 
generalizes the pricing techniques widely used in the game theory field to obtain a Pareto improvement 
of the system performance with respect to the Nash equilibria of a noncooperative game (see, e.g., |43j 
and references therein). 

Using Proposition SI we can now characterize and quantify the Nash equilibria of the game given 
in (pjj) by providing upper and lower bounds. 

Proposition 5 All the Nash equilibria p* of game ^ in 120\) satisfy the following inequality 

max min Rg{pg,p^g) < Rg{p* p* ) <^l, G (38) 

where is a NE of'^{X) defined in \30\l with a proper choice of X. 

Proof. See Appendix |Fl ■ 
Thus, at any NE of game 1^ in (j20p . the rate of each user is always better than that obtained by 
optimizing the worst case (which in general is too pessimistic). However, in general, the Nash equilibria 
of ^ may be Pareto dominated. We quantify numerically this loss in the next section. 



7 Numerical Results 

It is interesting to compare the Nash equilibria of game 1^ with the rates lying on the boundary of the 
rate region, in order to quantify the loss of Nash equilibria with respect to the Pareto optimal solutions. 
To this end, we consider the following two different topologies. In the first example, we assume that the 
system operates in a symmetric situation, whereas in the second example we consider an asymmetric 
scenario. 

Example 1: Symmetric Case. In this scenario, the system is assumed to be symmetric, i.e., the 
transmitters have the same power budget and the interference links are at the same distance (i.e., 
drq = dqr, yq,r), SO that the cross channel gains are comparable in average sense. In Figure [3l we plot 
the Pareto optimal points of (j34p and the Nash equilibria of game ^ defined in (j20p . for a two-users 
symmetric system. The two axes represent the rates, in bits/symbol, for the two links. The two pairs 
of nodes are placed at different distances, to test situations with different level of interference. In the 
picture, we plot: i) the Pareto-optimal boundary (133]) . referred to as TZmop (solid lines); ii) the NE 
points (*) of game given in ()20p : iii) the NE points of the modified game ^(A), given in (I36p . for 
different values of the vector A (squares); and iv) the rate region (referred as TZne) corresponding to 
the Nash equilibria achieved by varying the transmit power of each link, under the constraint that the 
overall transmit power is fixed (dashed lines) All the Nash equilibria of game are reached using 
the algorithms introduced in Part II of this paper [32], whereas the Nash equilibria of game §^(A) are 
reached using the gradient projection algorithm, proposed in [16j. From Figure[3l we infer that the Nash 
equilibria approach the optimal Pareto curve as the interference level decreases (i.e., the ratio drq/dqq 

^^Note that the comparison between dashed and solid Unes is not totally fair because all the rates on the boundary 
of TZmop are achieved with the same power constraint Pq for each transmitter, whereas the NEs reported in the dashed 
lines are obtained assuming only a total power constraint. 



increases) at least in the two user case. This is not surprising as, in the case in which the interference 
is sufficiently low, the interaction (interference) among users becomes negligible and the performance is 
limited by noise only, not by the interference. On the contrary, at small interpair distances (i.e., small 
ratios drq/dqq), interference becomes the dominant performance limiting factor and the loss resulting 
from using the decentralized approach becomes progressively larger. But the most interesting result 
is that this loss is limited also in the case where the links are rather close to each other (we have 
observed this result for several independent symmetric channel realizations). This suggests that, for 
symmetric systems, the decentralized approach, based on a game-theoretic formulation, is indeed a 
viable choice, considering its greater simplicity with respect to the centralized optimal solution. From 
Figure (Sj we also see that the solutions to the MOP can be alternatively reached as Nash equilibria of 
the modified game ^ in ()36p (Proposition S]), using the gradient projection algorithm of [16]. However, 
this algorithm cannot be implemented in a distributed way, as it requires the knowledge from each user 
of the channels and the power allocations of all the other links. 
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Figure 3: Rate region achieved for di2/d22 = ^12/^22 = 2, 3, 5, du — ^22 — 1, Pi = P2, snri = snr2 = 10 dB. Solid line, 
dashed line curves and asterisks refer on rate region given in (|33[l . rate region obtained by NEs of S^(A) given in (|36p. and 
NE points of game given in (|20[l . respectively. 



Example 2: Asymmetric Case. We consider now a two-users system operating in an asymmetric 
situation, where one link receives a large interference whereas the other does not. This asymmetry can 
be due to many reasons, such as different transmission powers and/or unbalanced cross gains among 
the users (i.e., di2 « ^21 or di2 >> d2i)- For the sake of brevity, in the following we consider only 
the latter case, i.e., the situation in which both transmitters have the same power budget but, because 
of the location of transmitters and receivers, one link receives much more interference than the other. 



In Figure 4(a) we plot the Pareto optimal points of (I34p and the Nash equilibria of game defined in 
(|20p . for a two-users asymmetric system, for different values of the ratio ^12/^21 and a given channel 
realization. Low values of ^12/^^21 correspond to high asymmetric situations. From the figure we infer 
that as the asymmetry of the system increases (i.e., ^12/^21 decreases) the loss of Nash equilibria with 
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Figure 4: a) Rate region achieved in the asymmetric case, for di2/d2i ~ 0.2,0.5,0.8, du — ^22 = 1, Pi = P2, 

snri = snr2 — 5 dB. Sohd hne and dashed hne curves refer on rate region given in (|33|l and rate region obtained by NEs 

of given in (|20p . respectively; b) Rate region achieved for L = 0, 4, 8, N = 64, ^12/^22 = rf2i/dii ~ 1, rfii ~ ^22 ~ 1, 

Pi — P2, snri — snr2 — 5 dB. 

respect to the corresponding Pareto optimal points becomes more significant. As an example, for the 
setup considered in the figure, the performance loss in terms of sum-rate can be as large as 30% of 
the globally optimal solution. The same qualitative behavior has been observed changing the channel 
realizations and the number of users. 

Example 3: Rate region versus channel order. Finally, we study the effect of channel frequency 
selectivity on the Nash equilibria of game in (|20p . To this end, we plot in Figure [4(b)| the average 
rate region obtained by the Nash equilibria of for different values of the channel order Lf^ . The channel 
taps are simulated as i.i.d. Gaussian random variables with zero mean and variance l/{Lh + 1). From 
the figure, we infer that the performance of decentralized system improves with the channel order. This 
is due to the fact that larger frequency fluctuations of the channel provide more degrees of freedom for 
each user to find out the best spectral partition for itself, given its own channel and interference. 

8 Conclusions 

In this paper we formulated the problem of finding the optimal precoding/multiplexing strategy in 
an infrastructureless multiuser scenario as a noncooperative game. We first considered the theoretical 
problem of maximizing mutual information on each link, given constraints on the spectral mask and 
transmit power. Then, to accommodate practical implementation aspects, we focused on the compet- 
itive maximization of the transmission rate on each link, using finite order constellations, under the 
same constraints as above plus a constraint on the average error probability. We proved that in both 
cases a NE always exists and the optimal precoding/multiplexing strategy leads to a (pure strategy) 
diagonal transmission for all the users. This result strongly simplifies the optimization, as we can 
reduce both original complicated matrix-valued games to a simpler unified vector power control game. 
Thus, we studied such a game, and derived sufficient conditions for the uniqueness of the NE, that 
were proved to have a broader validity than the conditions known in the literature for special cases of 
our game. Then, we compared the totally decentralized strategy with the Pareto-optimal centralized 
solution and we observed that the decentralized strategy has rather low performance loss with respect 
to the Pareto-optimal solution, especially for symmetric systems. Larger losses were observed in the 
case of very asymmetric systems. In the effort to approach the Pareto-optimal performance, we then 



showed how to modify the payoff of each user in order to create a modified game, whose Nash equihbria 
are Pareto-optimal. However, this comes at the cost of extra signahng among the users and breaks the 
noncooperative feature of the original games. 

Once proved that a Nash equihbrium exists and under which conditions is unique, the problem boils 
down to how to reach such an equilibrium. This problem is addressed in Part II of this paper |32|, 
where we provide a variety of distributed algorithms along with their convergence conditions. 

A Proof of Theorem [1] 

We first prove Theorem [T] for game S^i. Then, we show that the same result holds true also for game 
A.l Game 

Given in (l9|) , according to the definition of NE (cf . Definition [1]) , the proof consists in showing that 
for each player q, given the optimal strategy profiles of the others at some NE (corresponding to the 
optimal matrices = WSy^ with = diag(pr), Vr / q) i.e., R-, = o-gl + E H^gWS^W^H^, 

the maximum of mutual information Ig(Fg,F_g) defined in d?]), under constraints (jlOp . is achieved by 
'PqFq = WEqW^, where Sg = diag(pq) and pg is solution to ([20]). for fixed p_g. 

In the absence of spectral mask constraints, the statement of the theorem comes directly from 
the well-known diagonality result of the capacity-achieving solution to the single user vector Gaussian 
channel [M], and the fact that all channel matrices in d?]) are diagonalized by the IFFT matrix W: 

log (|I + Ff Hf^RliHggFgj) = log (|I + AgQgl) < ^ log (l + [A,],, [Q,],,) , (39) 

k 

where = W^F^F^W, we have used the eigen-decomposition H^RlgH^g = WAgW-'^, with 



\Hqq{k)\ 



2 



^ + Y.r^q\Hrq{k)\ Pr{k) 



and the last inequality follows from the Hadamard's inequality [34]. Since the equality in ()39p is reached 
if and only if Qg is diagonal and the power constraint Tr (F^F^) = Tr (Qg) < Pt depends only on the 
diagonal elements of Q^, we may set Qg diagonal w.l.o.g., i.e., Qg = S^; which leads to the desired 
optimal structure for Fg. 

Interestingly, in the presence of spectral mask constraints, we can still use the previous result since 
the additional constraints [W^F^F^W]^^ = [Qgjj^.;, < p^''^^{k) depend only on the diagonal elements 
of Qg. Introducing the optimal structure F^ = WSg in ^^i, we obtain the simpler game ^ in ()20p . 



A.2 Game 



2 



The proof hinges on majorization theory for which the reader is referred to [44j or [51 [l5]. The key 
definitions and results on which the proof is based are outlined next for convenience. 



Definition 2 ([441, 1.^.1]) For any two vectors x,y G M", we say x is majorized by y or y majorizes 
X ( denoted by x ^ y or y y x) if 



^X[k] < '^y[k], l<k<n, 

k=l k=l 
n n 

k=l k=l 

where and denote the elements of x and y, respectively, in decreasing order. 

Definition 3 A real valued function (p defined on a set A C R" is said to be Schur-convex on A if 

X -< y on A =^ 0(x) < (p{y). (41) 
Similarly, (p is said to be Schur-concave on A if 

X -< y on A =^ 0(x) > (/)(y). (42) 

Lemma 1 ( [44|, p. 7], \44\. 9.B.1]) For a Hermitian matrix A and a unitary matrix U, it follows that 

1(A) ^ d(U^AU) -< A(A), (43) 

where d(A) and A(A) denote the diagonal elements and eigenvalues of A, respectively, and 1 denotes 
the vector with identical components equal to the average of the diagonal elements of A. 



Interestingly, matrix U in (143^ can always be chosen such that the diagonal elements are equal to 
one extreme or the other. To achieve d(U''^AU) = 1(A), U has to be chosen such that U^'^AU has 
equal diagonal elements; to achieve d(U^AU) = A(A), U has to be chosen to diagonalize matrix A, 
i.e., equal to the eigenvectors of A. 

As in Appendix lA.il the proof consists in showing that, given the optimal strategy profiles of the 
others at some NE, the optimal precoding matrix of user q, solution to (jlSp . is in the form of (jl9p . 
Thus, hereafter we can focus on a generic user q, and assume that the strategy profiles of the others 
are fixed and equal to = WS^^ , with — diag(pr), Vr ^ q. We prove the theorem in two steps. 
Given q, first, we show the equivalence of the original complicated problem (llSp and a simpler problem, 
and then, we solve the simple problem, implying the optimality of in the form (|19p . 

Defining = W^Fg, the MSE matrix of user q can be written as 

E,(P^) ^ (I + Ff Hf,R-iH,,F,)-i = (I + Ff WA,W^F,)-i = (I + Pf A,P,)-\ (44) 

where in the second equality we used R_5 = a^gI+Yl H^gWE^W^H^ and the eigen-decomposition 

H^RIgHgg = WAgW-^, with A^ defined in ([jop . The matrix Eg(P^) has the following properties: i) 
Eq(Pg) is a continuous function^ ""I of Pg G C^^^; ii) For any unitary matrix U, Eg(Pg) satisfies 

E,(P^U) = U^E,(P^)U. (45) 



Using (j44p . the payoff function of user q (to be minimized) in (jlSp can be written as a function of 

/ f[E„(P„)l Y'-l\ 
1+- 



d(E,(P,)): 



/,(d(E,(P^)))^-r,(F,) = -i-5^1o: 



k=l 



kk 



(46) 



''This result can be proved using [46l Theorem 10.7.1]. 



V / 



kk 



where ^q{Fq) is defined in pT|) and the last equahty fohows from the relation SINRfc^g(Fg) = ^ Eq(Pg) 
—1, with SINRfc^q(Fq) defined in ([16]). The function fg has the following properties: i) /g(d(Eg(Pg))) 
is a continuous function of Pq G (^NxN (implied from the continuity of Eg(P^), as pointed out before); 
ii) /q(d(Eg(Pg))) is a Schur-concave function on (cf. Definitional) [S]- 
Using (j44p and (j46p . the optimization problem of user q in (jlSp becomes 

minimize /g(d(Eg(P ))) 



P 



Ijr (P„Pf ) < Pq, (PI) 

subject to N I 9 9 ^ - <?' 

d (P,Pf ) < p--, 

where p^'^^ = {p^'^^{k))^^-^^. Observe that problem PI always admits a solution, since the feasible set is 
closed and bounded (thus compact) and the objective function is continuous in its interior (as pointed 
out before). A priori the solution is unknown, but we assume it given by an oracle and denoted bjQ 
P*. As it will be shown next, we do not need to know explicitly such a solution to complete the proof 
of the theorem. Then, problem PI is equivalent to 



mimmize /q(d(Eq(Pg))) 



P 



subject to d (PgPf ) = d*, 



(P2) 



where d* = d (P*P*^) . The equivalence of both problems is in the following sense: 1) The optimal 
(and thus feasible) point P* of PI is feasible in P2 with the same value of the objective function; 2) A 
feasible point in P2 (not necessarily optimal) is feasible also in PI with the same value of the objective 
function. 

Thus, for any given d*, we can focus on solving problem P2 w.l.o.g., and show that the op- 
timal solution to P2 is diagonal; which leads to the desired structure for the original matrix Fq 
in ([TH]) . Since (d (Eg(Pg))) is Schur-concave, it follows from Definition [3] and Lemma [1] that 
/q (d (Eq(Pq))) > (A (Eq(Pq))) . Interestingly, for any given Pg, it is always possible to achieve 
the lower bound fg (A(Eq(Pg))) of fg (d(Eq(Pq))) by using instead the matrix Pg = PgU such that 
Eq(Pg) = U''^Eg(Pg)U is diagonal (see ()15]) 1. without affecting the constraints, since d(PqP^) = 
d (PgPf ) . In such a case, fg{d(Eq{P q))) = /g(A(E^(Pg))) = fg (d(Eg(P5))) . This implies that there 
is an optimal Pg for problem P2 such that Eq(Pg) is diagonal or, equivalently from (pi|) . such that 
P^AqPg is diagonal. 

Now, given that P^AgPg is a diagonal matrix, let us say Xlg, i.e., P^AgP^ = Hg, we can write 



]^/2 1/2 t 

Pg as Pq = Aq UgXlg w.l.o.g. (see, e.g., [UllS]), where \Jq is any unitary matrix. Using such 



a 



structure of Pq, problem PI can be equivalently written as 

N 



minimize 



k=i ^ ^1 ^ 



(47) 



subject to d (UgE.Uf ) = d, 



where d^ = Agd*. 



From majorization theory (see [441 5. A. 9. A and 9.B.2] or [451 Lemma 4 and Lemma 3]) we know 
that, for any given S^, we can always find a unitary matrix Ug satisfying the constraint in (j47p if and 
only if 



In the case of multiple solutions, we may choose one of them, w.l.o.g.. 



A(S,) y dg. (48) 

Therefore, we can first find the optimal Sq in (|47p replacing the original constraint with (j48p and then 
choose the matrix Ug to satisfy the constraint in (j47p with that optimal Xlg. Therefore, we have 

mimmize -|:iog,(l+^) ^^^^ 

subject to d(Sq)^d^. 

Since the objective function in ([49]) is Schur-convex, it follows from Definition [3] that the optimal 
solution to (j49p is d (5]*) = d^. Given such a 5]*, the constraint in (j47p is satisfied if the matrix 
Uq= I, implying for in problem P2 the optimal structure Pq = Aq ; which leads to the 

desired expression for = WPg = WAg ""^^^S^^ = Wy^ diag(pg). This proves that all the solutions 
to problem PI can be obtained using Fq = Wy^ diag(pq). ■ 

B Proof of Theorem [2] 

B.l Existence of a NE 

First, we show the existence of NE for the game in (I20p using the following fundamental game theory 
result: 

Theorem 3 1141 HE/ The strategic noncooperative game ^ = {17, {A'^lggQ, {<I>q}qgf2} admits at least 
one NE if, for all q E O; 1) The set Xq is a nonempty compact convex subset of a Euclidean spacJ^. 
and 2) The payoff function '&g(x) is continuous on X and quasi- concav^^ on Xq. 

The game ^ in (j20p always admits at least one NE, because it satisfies conditions required by Theorem 
[3l 1) The set !^q of feasible strategy profiles, given by (f22|) . is convex and compact (because it is closed 
and bounded); 2) The payoff function of each player q in (|2Up is continuous in p and (strict) concave 
in pq G ^q for any given p_g (this follows from the concavity of the log function). Hence, it is also 
quasi-concave [33]. 



B.2 Uniqueness of the NE 

To prove the uniqueness we need the following intermediate results. 

Definition 4 A matrix A G M"^" is said to he a Z-matrix if its off-diagonal entries are all non- 
positive. A matrix A G M"^" is said to be a P-matrix if all its principal minors are positive. A 
Zi-matrix which is also P is called a Ji-matrix. 

Many equivalent characterizations for the above classes of matrices can be given. The interested 
reader is referred to [48 j [59] for more details. Here we focus on the following two properties. 



Lemma 2 ( [48', Theorem 3.3.4]) A matrix A G M"^" is a P-matrix if and only if A does not 
reverse the sign of any nonzero vector, i.e., 

Xi [Ax] . <0 for alii x = 0. (50) 



^^A subset Xq of a Euclidean space is compact if and only if it is closed and bounded. The set Xq is convex if 
X = ex'"' + (1 - 6l)x(i) e Xq, Vx<°',x(^) G Xg and 6 G [0, 1]. 

^"A function / : "i^ C R"h-*R is called quasiconvex if its domain 'rf and all its sublevel sets = £'rf \ f (x) < a}, 
for Q G R, are convex. A function / is quasi-concave if — / is quasi-convex. 



Lemma 3 ( [48', Lemma 5.3.14]) Let A G M"^" he a 'K-matrix and B a nonnegative matrix. Then 
p{A^^'B) < 1 if and only i/ A — B is a K-matrix. 

We provide now sufficient conditions for the uniqueness of the NE. First, we derive a necessary 
condition for two admissible strategy profiles (whose existence is guaranteed by the first part of the 
Theorem) to be different Nash equilibria of the game in (j20p . Then we obtain a simple sufficient 
condition that guarantees the previous condition is not satisfied; hence guaranteeing that there cannot 
be two different Nash equilibria. 

Assume that the game ^ admits two distinct NE points, denoted by p^'''',p^^^, where p^^^ = 
[pi''^'^,... ,pj^'^]^ and p^*"^ = \p'ij'\l),... ,p[''\N)f, with r = 0,1, q £ Then, they must satisfy 
the following (necessary and sufficient) Karush-Kuhn- Tucker (KKT) conditions (see, e.g., [33|) for each 

q e 27V+1 

Vp,/2, (p(^)) + E /^SVpJ,,,(pf)) = 0, 



k=l 



r = 0,1, 



A^J:i/,,A:(prO = 0, M$ > 0, UkiP^'-') > 



where /x^''^ = [fJ^^gl, ■ ■ ■ ^^J'q2N+l\'^ nonnegative Lagrange multiplier vector, and 



(51) 



[Vp,i?,(p) 



iil< k<N, 
ff k = N + l, 



p^^'^ik -N-l)-p^''\k-N-l), if N + 1< k <2N + 1, 



log e 



HqqikW 



TgiV 1 + E?^«,.=i l^..(A;)l'pr(fe) + r,-i |//,,(A;)|2p,(A;) 
log e 



\Hqqik)\' 



(52) 



(r) 

with 5rq defined as 6rq = 1 ii r = q and 6rq = otherwise. Note that the functions fq,k{Pq ) are (linear) 
concave, thus it must be that, for each and [33] 

4fc(pS'))</g,fc(pf) + (pS'^-pS°0^Vp, /,,fc(pf), VA;G{l,...,2iV+l}, V<7 G f). (53) 



Multiplying the first equation of (j51|) by ^Pg^^ — Pq'^^^ for r = 0, by (^Pq^^ — Pg^^^ for r = 1, and 
summing over r we obtain, for each q £ Q : 



p^i' - vp,i?, (p(o)) + (pr - pi^T vp,^. (p«) + 



E 

fe=i 



(pS^^ - pS°f vp,/..(pr) (pf^ - P^^o' vp,/..(pS^)) 



The second term in (j54p is clearly nonnegative since it is lower bounded by 

2Af+l 



k=l 
2N+1 



(0)^ 

q > 



> 0, 



A:=l 



(54) 



(55) 



where we have used the (linearity) concavity of fq^^ a-nd the constraints P'^q\fq^k{Pq') = 0. 



It is now evident that if, for some q, the first term in (|54p is strictly positive, then p^'^^ and p^^^ 
cannot be both NE points, as (|54p would not hold. We will next obtain a simple sufficient condition 
for this term to be indeed positive for different points p^''^ and p^^^; in other words, to guarantee that 
there cannot be two different NE solutions. 

The positivity condition for the first term in (j54p is, for some q 

(p«-pr)^(Vp,i?,(p(°))-Vp,i?,(p«)) 

k&Vg \ r=l / 

(56) 



where (k,p^^\p^-^^] 



loge 



1 2 JO), 



|2jl)/ 



with 



aq {k, P*'"'^'*) > 0, and p*-*^-* ^ p^^^. In (f56l) we have used the fact that, outside Vq, we have p'q\k) 
p^q\k) = 0, where Vq is defined in (i27|) . 

Define ICq as the set of carriers in Vq such that the two solutions coincide, i.e., for user q: 



lCq^[keVq\p^l\k)=pf{k)]. 



(57) 



Observe that it cannot be that KLq = Vq for all q, since p^'^^ and p^^^ are different solutions by as- 
sumption. From (j56p it follows that p^^^ and p*-^^ cannot be both NE points if the following sufficient 
condition is satisfied: 



\Hqq{k)\'' (pj^) (A:) - {k)) Tq ^'■'\Hrq{k)\'' [p^'^k) - pf\k)) > 0, Vfc G and some q E Q. 

r=l 

(58) 

Since in ([58]) k G Vq\ICq, it follows that p^^^ (A;) — pq^^ (k) / 0. The sufficient condition is then, 
V/c G Vq\ICq and some g G 17, 

-|/f,,(fe)|2|p«(A;)-pf (fc)|+sign(p«(A;)-p(0)(fc)) ^ |/?,,(A;)|2 (p«(A;) - p(o)(A;)) > 0, (59) 

^ r^q,r=l 

where sign (•) is the sign function, defined as sign (x) = 1 if x > 0, sign (x) = if x = 0, and 
sign (x) = — 1 if X < 0. Using the fact that pi^^ (k) — p^^ (k) = whenever k ^ V^, condition ()59p can 
be equivalently rewritten as, V/c € Vq\K,q and some g G il, 



p'i\k)-pf\k) +^ign(p^^\k)-p^^\k)) Y Grq{k)[pi'\k)-pi'\k))>0, (60) 



with 



Grqik) 



\Hrq{k)\'^ dqq P^. 
\Hqq{k)\'^ d!^q Pq 



, ifk£Vr, 
otherwise. 



(61) 



A more stringent sufficient condition than (j60p is obtained by considering the worst possible case, i.e., 
when the second term in ()60p is as negative as possible: 



Ag(A;) > Y^ Grq{k)Ar{k), Mk G Vq\JCq and some g G il, 

r^q,r=l 



(62) 



where Ag(A;) is defined as 

A,(A;)^ /;\k)-pf{k) . (63) 
Note that, since p^'^^ 7^ p^"^^ by assumption, it must be 

Aq{k) / 0, yke VgXlCq and some q £ n. (64) 

Thus far, we have that condition (j62p can not be satisfied by the two different NE points p^'^^ and 
p(^). This means that the following condition needs to be satisfied by such p^*^) and p^^^^: 

Q 

Ag(/cg) < Grq{kq)Ar{kq), ioT souie kg ^ Vq\JCq and \/q & Q, (65) 

ry^q,r=l 

where for the sake of notation we denoted by kq any subcarrier index in the set Vq\lCq. Introducing 

Grq{k) = < _ (66) 

0, otherwise, 

and using the fact that each Ag(fc) = whenever k ^ Dg and k £ ICg, condition (j65p becomes 

Q 

Ag(/cg) < ^ Grq{kq)Ar{kg), for some fcg G Pg\/Cq and G 0. (67) 

rj^q,r=l 

We rewrite now condition (|67p in a vector form. To this end, we introduce the A^Q-length vector 
A = [A^(l),...,A^(iV)]^, with A{k) = [Ai(A;),...,AQ(A;)]^and the matrices H G M^«x^« and 
H(A:) G M^'''^ defined as 

H-cliag({HW}M, and | W. if ^ = and . ^ ^^^^ 

^ ^ I U, otherwise. 

Using (f68|l . condition ([67ll can be equivalently rewritten as 

(I - H) A < 0. (69) 



The proof will be completed by showing that condition (jCip is enough to guarantee that (j69p cannot 
be satisfied by two different solutions (implying then that (j62p is satisfied and that the two different 
solutions are not NE points). Since A > 0, inequality ([69]) implies 



A, [(I-H) A]. <0, yi = l,...,NQ, (70) 

where Aj denotes the i-th component of A. It follows from Lemma [2] that, if 

I — H is a P-matrix, (71) 

inequality ()70p provides Aq(/c) = 0, for all g G i7 and k G {!,..., A^}; which contradicts the initial 
assumption that p^'^^ and p*^^^ are two different points (see (j64p ). Therefore, condition (j7ip is sufficient 
to guarantee the uniqueness of the NE. 

We complete the proof showing that condition (ICip implies (I7ip . We first introduce the matrices 
H G M^^^^^ and H(fc) G R^""^ defined as 

H 4 diag (mk)}Q , and [H(^)]^, 4 ( J^-^^^' [ + ^' (72) 
^ ' U, otherwise. 



Observe that H(A;) in (j72p coincides with the matrix defined in (|26p . Since I — H and I — H are 
Z-matrices (cf. Definition H]) and I — H > I — H, where ">" is intended component- wise, a sufficient 
condition for (j7ip is the following |48j 

I — H is a P-matrix. (73) 

Given (j73p . I — H is a K-matrix (cf. Definition S]) . It follows from Lemma [3] (using the corre- 
spondences: A = I and B = H) that condition (j73p is equivalent to p (H) < 1; which, exploring the 
block-diagonal structure of H in ()72p leads to condition (CI). This completes the proof. 
Condition ()C2p in Corollary [1] is obtained using the following result \5U, Corollary 8.1.19] 

< H(A;) < H™^^ p (H{k)) < p (H""^^) . 

Conditions ()C3P - ()C4p in Corollary [2] can be obtained as follows. Using |50l Theorem 5.6.9] 

p{U{k))=p{U^{k))<mk)\\, (74) 

where ||-|| is any matrix norm [50], a sufficient condition for (jCip is ||H(A:)||^ < 1, with ||-||^ denoting 
the weighted block maximum norm, defined as [50] 



||H(fc)C4max-5^u;.[H(fc)],^, (75) 
and w = [wi, . . . , wq^ is any positive vector. ■ 

C Proof of Proposition [2] 

We first show that, in the case of high interference, an orthogonal NE always exists. Then, we prove that 
if an orthogonal NE exists and condition (|3ip is satisfied, at the equilibrium, the available subcarriers 
must be distributed among the users according to (j32|) . 

To prove that an orthogonal NE always exists in the high interference environment, it is sufficient 
to show that, at such a point, all the users do not get any rate increase in sharing some subcarriers. 
The power distribution of each user q at any orthogonal NE must satisfy the single-user waterfilling 
solution over the subcarriers that are in Xg, i.e., 



Smq\Hqq{k)\ q ^ ^l. (76) 

0, otherwise. 



It is straightforward to check that a power distribution as in (|76p always exists, provided that 
Q < N. For example, in the simple case of Q = N, there are Q\ different partitions Xi, X2, . . . ,Xq of 
the set {1, . . . , N}, corresponding to all the permutations where every user uses only one carrier; which 
guarantees ([76]) to be satisfied. 

Observe that a strategy profile as in (j76p is not necessarily a NE. However, for any given {snrgj^gQ, 
there exists a set {inr^^jr^ggn, with each inr*^ >> 1, such that, for all inr^g > inr*^, 

+ Pq < r„ '■ ' " ' i9 ' ' ' ' 1 S Ir, kg G Iq, '\/r ^ q £ 17, 



snr^ \Hqq{kq 



1 + inr,., 1 


Hrq{kr)\ 


Pr{kr) 


snrg 


\Hqq{kr , 


j2 



which is sufficient for q^-th user to allocate no power over any subcarrier in X^. Thus, every user q, given 
the power distribution of the others, does not have any incentive to change its own power allocation. 
Hence, the strategy profile given by ([76]) is a NE. 

We assume now that an orthogonal NE exists and that condition (j3ip is satisfied. At such a NE 
the KKT conditions must hold for all users. Thus, for each q and k, it must be 



loge \Hqqik)\ 



2 



^i^l + E?=i^^''"Ptik)\Htqik)\^ ^^"^ VgGf^,VA:e {!,..., iV}, (77) 

Pq{k) > 0, (1/iV) J]f^^P,(^) =h f^q> 0, 

where we used the fact that at the optimum the power constraint must be satisfied with equality and 

1/2 ~ 1/2 ~ 

Hrq{k) = inrrg Hrq{k), \/r ^ q and Hqq{k) = snr^ Hqq(k), \/q G Q,. Note that in the first expression 
equality is met if Pq{k) > 0. Now, we characterize the set Zq. Assume that kq £ Tq. Using ([771) we can 
write 

- f^q, . , < f^r, > 0, > 0, r / q. (78) 



1 + T-\ikq)\Hqq{kq)\^ 1 + Pqikq)\Hqrikq)\^ 

It follows that 

\Hrr{kq)\^^+r-%{kq)\Hqq{kq)\'' a \Hrr{kr)\^ I + Pr{kq)\Hrq{kr)\'' 



\Hqq{kq)\'^ I + Pq{kq)\H qr{kq)\'' " " \H qq{kr)\' I + p^{kr)\Hrr{kr)\'' 

Using ([3T]) we have 



\Hrr{kq)\^ 1 + T'^Pg (/Cg) | i7,,(/c,) ^ ^ \Hrr{kq)\^ \H„iK)? 1 + (fc,) | g,,(fc,) ^ ^ \Hrr{kr)[' 



\Hqq{kq)\^ I + Pq{kq)\H qr{kq)\^ " | ( fe,) | ^ ' \H qq{kr)\' I + TV^ p,{kr)\H„{kr)? ' \H qqikr)^ 

(80) 

Introducing ([80]) in ([79]) we obtain the desired relation. ■ 

D Proof of Proposition [3] 

Under the conditions \nr rq << 1 and snrg >> 1, \/q,r ^ q £ Vt, each rate i?q(p) can be approximated 
with a negligible error b}o 

^^..J T-^Snrq\Hqq{k)\^ pq{k) Gqq{k)pq{k) \ 



Vg isnrg j 


\Hqq{k)\ 




1 + E?^5,r=lin''rg| 
■q\Hqq{k)\\Grq{k) 


\Hrq{k)\ 
A ■ 


Pr{k) ^ 

\R,q{k) 



^ ^ S UEi_iin..ii^.,(.)r..(.)j ^ ^ S U 



^ Vl + I^r^,,r=lG'rg(%.(fc)y 

(81) 

3 e P ^ {x G M^^ : 1% < 1, Xq{k) < 
p^^^{k), V/c E {1, . . . , A^}, Vg G i7} . To prove the convexity of the rate region defined in ([33]) . we need 
to show that 

3 p G I af + /3f < R(p), (82) 



(83) 



whenever _ - ^ 

r < R(p), for some p G P; 

f < R(p), for some p G "P; 
and a, (3 > 0, a + (3 = 1. Starting from ([82]) and using ([83]) we have, for each q, 

+ /3f < 1 f falog ( ^'^'^(■^y''''' ^ +.10^ Gqqik)e^^i^^ \ \ 



There always exist a set of inrrq and snrq such that all the links transmit over the whole bandwidth. 



whereXg(A;) = \og^Pq{k) a.ndxg{k) = log^, Pq{k),\/q G Jl. Since the function log ^^1 + Y^rj^q^rqe^^'j Gqq^ 

with positive {Grq} , is convex in x G M'^ ( the function log ^1 + Yl^q=i ^^''^ is convex in M" |33|), ex- 
pression ([8^ can be upper-bounded by 

ar„ + irq< — } log ^ = ^g(p), (85) 

V + Er^qGrq{k)P?{k)p^r{k)J 

where p is given by Pq{k) = Pq{k)pq{k). Observe that p is feasible as p < ap -|- /Jp € 7^, where we have 
used the geometric-arithmetic inequality [17] and the fact that V is convex. ■ 

E Proof of Proposition [4] 

After showing that the (globally) optimal solution to the scalarized MOP (I35p corresponds, for any 
given A > 0, to one of the Pareto optimal points of the MOP ()34p . we prove that such a solution is a 
NE of the game ^ (A) defined in (j36p . We then show that, under conditions of Proposition O the game 
^ (A) admits a unique NE, for any given A > 0. 

Given the MOP (j34[) . consider the scalarized MOP defined in ([35]). For any given A > 0, the 
(globally) optimal solution to (|35p is a point on the trade-off surface of MOP (j34p . This follows directly 
from the definition of Pareto optimality. Using this result, we now derive the relationship between the 
scalarized MOP ([35]) (and thus the MOP ([34])) and the game #(A) defined in ([36])- Let p* be the 
solution to (|35p for a fixed A > 0. Then 

Q Q 

^XqRqip*) ^ ^ Agii,(p), Vp / p^ p„ p^ G ^q,yq G Q. (86) 

5=1 q=l 

Hence, setting p = (pg,p*g), from the above inequality it follows 

RqiPq,P-q) + ^^2 KRr{p*q, P-q) > Rq{Pq, P-q) + ^ KRr{Pq, P%) , Vg G Q, (87) 

r^q r^q 

or, equivalently, 

$9(p*, P* ,) > ^q{Pq, P%), yq G n. (88) 

with <I>g(pg,p_g) Is glveu by ([37]) . But ([88]) is indeed the definition of NE for the game ^ (A) given 
by (I36p . Hence, it follows that, for any given A > 0, the set of Nash equilibria of ^ (A) contains 
a Pareto optimal point of (I34p . However, the converse is, in general, not true. Observe that the 
payoff functions <I>g(pq, p_g) are not quasiconcave in pq and, hence, the classical results of game theory 
providing sufficient conditions for the existence of a NE (as, e.g.. Theorem [3] in Appendix [B]) cannot 
be used for the game ^(A). Still, the game ^ (A) admits at least one NE, since a solution to the 
scalarized MOP ([35]) always exists, for any given A (the domain x . . . x of ([35]) is compact and 
the objective function is continuous in the interior of the domain). 

We show now, that under conditions of Proposition [3l the correspondence between the set of Nash 
equilibria and the Pareto optimal points of (I34p becomes one-to-one, and, moreover, any Pareto optimal 
point is, for a proper A, the unique NE of (A). Assume that conditions of Proposition [3] hold true. 
Then, the rate region (|33p becomes convex, and thus all the points on its boundary can be achieved 
solving, for any given A > 0, the scalarized MOP in ([35]) [15j and thus, as shown above, using the game 



^ (A) . To complete the proof, we need to show that, in such a case, (A) admits a unique NE, for any 
given A > 0. 

Since the game 'S (A) satisfies the conditions of Theorem [3] (see Appendix |B|) . it is sufficient that 
the (sufficient) condition for the uniqueness of the NE (in (j54p l holds true. Replacing in (j54p the payoff 
functions -Rg(p) of ^ with the payoff functions 'I'^ (p) of 5^ (A) and summing over g, we obtain the 
following sufficient condition for the uniqueness of the NE of the game (A), Vp*^"*^^ 7^ P^'^^, Pg^'*iPg*''* £ 



E (Pj^^-Pf) Vp,$,(p(°)) + (pf -p«) Vp,$,(p« 
9=1 



> 0. (89) 



Under Proposition [3l (p<j,P-q) is given by ([37|) . where each i?^ (Pg^P-g) can be approximated with 
a negligible error by ([8T]) . Hence, each $q (pq,p_q) becomes a strict concave function with respect to 
Pq, for any given p_q and A > 0, and thus it satisfies the following inequality [33] 

- Vf)^^.q\ (pf ,PL°]) > $,(p«,pL°i) - $,(pf ,pL°]). (90) 

Using (jSip and (j90p we find that condition (j89p is satisfied. Hence the game (A) admits, for any 
A > 0, a unique NE that corresponds to one of the Pareto optimal points of the MOP (j34p . ■ 



F Proof of Proposition [5] 

The upper bound of Rq{p*, p* g) in (138]) comes directly from (f37ll . Thus we prove only the lower bound 
of Rg{pq,p*_q). To this end, we use [371, Corollary 37.6.2] and [171 Lemma 36.2]. 
Starting from the definition of NE, at any NE p* we have that 

^g(Pg,P-g) = max i?g(p„p* )> miri max^ Rq{pq,p_q). (91) 

Inequality (|38p comes directly from (|9ip interchanging the order of the supremum and the infimum 
operators. To this end, it is sufficient that, for each q, the functions Rq{pq,p-q) in (j2ip and the sets 
^q, defined in ([22]) . and = ,^1 x . . . x ^q-i x =^^+1 x . . . x ^q, satisfy conditions required by 
[471 Corollary 37.6.2] and [47\ Lemma 36.2]. The set ^1 x . . . x is a closed bounded non-empty set 
and thus conditions in [47t Corollary 37.6.2] are satisfied. Each function Rq{pq,p_q) is strictly concave 
with respect to Pq for any given p_g G -^-q and convex with respect to p_q for any given Pq G ^q. 
Hence each i?g(pg,p_q) is a concave-convex function on ^q x ^-q. ■ 
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